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ADMINISTRATIVE  INFORMATION 


I . 


The  research  project  reviewed  in  this  report  was  supported 
by  the  Decision  Sciences  Laboratory,  Electronic  Systems  Division, 

U.  5.  Air  Force,  under  Contract  No.  AF  19 ( 628 )-l6l0 .  The  project 
began  in  the  summer  of  196?  and  terminated  in  1969.  Dr.  Emir  H. 

Shu. ford  servec  as  principal  investigator  during  the  earlier  portion 
of  the  project;  Dr.  Albert  H.  Amon  served  as  principal  Investigator 
during  the  latter  portion.  Other  participants  in  the  project  have 
included  Dr.  Masanao  Toda,  Mr.  William  H.  Wynn,  Mr.  Amnon  Rapoport, 
Mr.  David  Messick,  Mr.  Edward  Massengill,  Mr.  Raymond  Wiesen,  and 
Mr.  Larry  Gordon.  Dr.  Lyle  V.  Jones  has  provided  advisory  assis¬ 
tance  throughout  the  course  of  the  research  program.  This  final 
report  has  been  prepared  by  Mr.  William  Wynn. 
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II.  GENERAL  REVIEW  OP  RESEARCH  OBJECTIVES  AND  FINDINGS 


Most  of  the  research  undertaken  In  this  project  attempted  to 
show  how  Bayesian  decision  theory  might  be  used  as  a  descriptive 
theory  of  human  information  processing.  There  are  several  reasons 
for  using  decision  theory  as  a  starting  point  for  a  theory  of  human 
behavior.  The  first  reason  is  that  a  decision-theoretical  analysis 
of  a  task  yields  an  upper  limit  on  the  performance  possible  in  the 
task  as  presently  structured.  This  upper  limit  of  performance  is 
relevant  in  evaluating  the  efficiency  of  human  operators  and  the 
adequacy  ol  tiaining  programs,  as  well  as  guiding  the  development 
of  man-machine  systems  which  yield  maximal  performance.  Another 
reason  for  using  decision  theory  as  a  descriptive  theory  of  human 
behavior  derives  from  the  fact  that  it  is  a  model  which  represents 
optimally  efficient  behavior.  It  usually  proves  an  easy  task  to 
modify  this  model  to  correspond  to  the  apparent  relative  inefficiency 
of  behavior  by  introducing  certain  constraints  such  as  a  limited 
and  imperfect  memory ,  infoimation  processing  errors,  "cost  of  think¬ 
ing,"  etc.  The  validity  of  these  modifications  can  be  empirically 
determined . 

There  are  a  number  of  possible  decision-theoretic  formulations. 
They  all  share  In  common  the  notion  of  a  payoff  matrix  In  which 
a)  the  rows  represent  the  alternative  courses  of  action  available 
to  the  decision  maker,  b)  the  columns  represent  either  the  alterna¬ 
tive  courses  of  action  available  to  the  opponent  in  game  theory  or 
the  possible  states  of  nature  in  decision  making  under  uncertainty, 
i.e.,  statistics  broadly  interpreted  (Luce  &  Raiffa,  1957;  Raiffa 
&  Schlaiffer,  1961),  and  c)  the  entries  in  the  cells  of  the  matrix 
represent  the  utilities  associated  with  the  possible  outcomes.  The 
utilities  are  measured  on  a  psychological  continuum  defined  up  co 
a  linear  transformation  (von  Neumann  &  Morgenstern,  19^7;  Luce  & 
Raiffa,  1957;  Shuford,  Jones,  &  Bock,  1961). 

The  decision-theoretic  formulations  differ,  however,  with  re¬ 
spect  to  the  criterion  for  an  optimal  strategy  (Luce  &  Raiffa,  1957). 
It  often  has  been  assumed  that  the  minimax  strategy — i.e.,  choose 
that  course  of  action  which  results  In  maximum  utility  assuming  the 
worst  possible  outcome--is  reasonable  at  least  for  the  case  in 
which  one  is  playing  a  strictly  competitive  game  against  a  rational 
opponent.  This  view  is  subject  to  the  criticism  that  even  in  a 
competitive  situation  one  should  take  account  of  the  past  behavior 
of  the  opponent  and  all  other  possible  relevant  Information,  a 
policy  which  is  in  general  incompatible  with  the  minimax  strategy, 
but  which  is,  In  fact,  an  Intuitive  restatement  of  a  Bayes  strat¬ 
egy — i.e.,  use  all  available  information  to  assign  (subjective  or 
personal)  probabilities  to  the  opponent's  actions  or  to  the  states 
of  nature,  use  these  probabilities  to  compute  the  expected  utility 
for  each  alternative  course  of  action,  and  then  select  that  course 
of  action  which  has  the  largest  expected  utility. 
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It  seems  apparent  that  Bayesian  decision  theory  is  more  power¬ 
ful  and  more  general  than  the  minimax  strategy  because  it  includes 
an  additional  psychological  variable,  subjective  probability.  Also, 
with  the  development  of  a  theory  of  subjective  probability  (Shuford, 
1959;  Wiesen  &  Shuford,  1961)  which  can  account  for  the  modification 
of  probabilities  on  the  basis  of  past  experience  and  new  information, 
we  have  a  descriptive  decision-theoretic  model  which  is  dynamic  in 
the  sense  of  accounting  for  changes  in  behavior  over  time.  Further, 
the  model  describes  and  can  be  used  to  define  adaptive  behavior. 

In  developing  and  testing  Bayesian  models  for  individual  de¬ 
cision  making,  the  subject's  task  in  an  experiment  is  conceived  to 
be  a  problem  in  decision  making  under  uncertainty  in  which  the  sub¬ 
ject  responds  on  each  trial  of  the  experiment  in  such  a  way  as  to 
maximize  his  average  payoff  during  the  course  of  the  experiment. 

Two  sources  of  Information  are  available  to  the  subject:  a)  the 
information  obtained  from  observing  the  particular  stimulus  presen¬ 
ted  on  that  trial,  and  b)  the  background  information  as  to  the 
probabilities  with  which  the  different  stimulus  values  might  be 
presented.  This  background  information  is  provided  by  the  subject's 
experience  before  participating  in  the  experiment,  the  instructions 
and  the  context  of  the  experiment,  and  during  the  course  of  the 
expei'iment  by  the  stimuli  previously  observed  by  the  subject.  The 
background  or  prior  information  is  available  at  the  beginning  of 
the  trial.  The  subject  observes  the  stimulus  and  obtains  (Imper¬ 
fect)  Information  as  to  the  value  of  the  stimulus  on  this  particular 
trial.  The  subject,  in  effect,  combines  this  specific  information 
with  his  prior  informatic.i  and  obtains  a  posterior  probability  dis¬ 
tribution  (representing  the  subjective  probability  of  the  stimulus 
value  given  the  observation)  which  is  used  to  determine  the  expec¬ 
ted  utilities  of  the  different  response  categories  and,  thus,  the 
optimum  response.  The  models  use  Bayes'  theorem  In  the  computation 
of  the  posterior  probabilities  and  in  many  cases  are  formally  analo¬ 
gous  to  the  models  used  in  Bayesian  statistical  inference  (Schlaifer, 
1959;  Schlaifer,  1961;  Ralffa  &  Schlaifer,  1961;  Shuford,  1961b). 

By  assuming  that  man  is  a.  Bayesian  information  processor, 
several  predictions  were  derived  relating  probability  of  correct 
response,  amount  of  information  requested,  and  decision  time  to  two 
Independent  variables  of  interest — number  of  response  categories 
and  disparity  between  response  alternatives.  It  was  shown  that 
these  relations  depend  upon  other  aspects  of  the  decision  task  such 
as  the  size  of  the  payof fs ,  the  cost  of  information,  the  probabili¬ 
ties  of  the  states,  and  the  conditional  probabilities  of  the  data. 

During  the  early  portion  of  the  project  a  number  of  signifi¬ 
cant  theoretical  developments  occurred.  One  line  of  development 
carried  on  by  Dr.  Masanao  Toda  and  by  Mr.  David  Messick  was  con¬ 
cerned  with  the  measurement  of  subjective  probability  distribu¬ 
tions  (see  Working  Papers  Nos.  1,  2,  3,  6,  and  7).  Another  line 
of  development  carried  on  by  Dr.  Emir  H.  Shuford  and  by  Mr.  Raymond 
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A.  Wiesen  was  concerned  with  optimal  models  for  learning  prior 
probabilities  (see  Working  Papers  Nos.  5,  8,  10,  and  18).  Dr. 
Masanao  Toda  compared  Bayesian  and  minimax  strategies  in  zero-sum 
two-person  games  (see  Working  Paper  No.  9)*  Mr.  David  Messick 
applied  concepts  from  Bayesian  decision  theory  and  the  theory  of 
non-zero-sum  games  to  the  social  psychological  problem  of  group 
decision  making  and  group  problem  solving  (Messick,  1963).  Dr. 

Emir  Shuford  showed  how  it  was  possible  to  take  a  well-defined 
decision  situation  (one  in  which  all  distribution  functions  and 
parameters  are  known)  and  successively  degrade  the  situation  down 
to  a  point  where  no  decision  problem  remains,  by  progressively 
relaxing  the  assumptions  about  the  nature  of  the  decision  situa¬ 
tion  (see  Working  Papers  Nos.  15,  17,  19,  21,  and  31). 

An  early  experiment  performed  by  Mr.  Raymond  Wiesen  applied 
decision  theory  in  an  investigation  of  some  probabilistic  learning 
situations  (Wiesen,  1962).  Three  related  choice  situations  were 
presented.  The  first  situation  was  a  standard  probabilistic  dis¬ 
crimination  learning  task.  Each  trial  began  with  the  presentation 
of  one  of  a  set  of  stimuli.  The  subject  chose  between  the  two 
response  alternatives  to  predict  which  of  two  events  would  occur 
on  the  trial,  the  probability  of  each  event  being  a  function  of 
the  stimulus  presented.  In  the  second  situation,  the  probabilities 
of  the  stimuli  given  the  events,  i.e.,  the  conditional  probabili¬ 
ties,  were  introduced  to  the  subject  at  the  beginning  of  the  ex¬ 
periment.  The  third  situation  was  like  the  second  except  that  the 
subject  was  not  told  which  event  occurred  on  each  trial.  The 
decision-theoretic  analysis  showed  what  differences  in  performance 
would  be  expected  among  the  three  conditions  when  a  strategy  which 
maximizes  average  expected  payoff  is  employed.  One  group  of  sub¬ 
jects  was  run  in  each  situation  with  the  overall  relative  frequency 
of  one  event  equal  to  .80.  The  performance  of  subjects  in  the  first 
and  second  situations  was  virtually  identical,  indicating  that  sub¬ 
jects  In  the  second  group  did  not  integrate  information  concerning 
the  overall  relative  frequencies  of  events  and  conditional  proba¬ 
bilities.  Subjects  In  the  third  situation  showed  poorer  per¬ 
formance  than  those  in  the  other  two  groups,  with  a  large  proportion 
of  these  ( non- feedback)  subjects  making  every  choice  in  agreement 
with  the  assumption  that  the  overall  relative  frequency  of  one  event 
was  one-half. 

As  part  of  the  program  for  the  experimental  evaluation  of  pre¬ 
dictions  from  the  Bayesian  model,  two  computer-controlled  decision 
tasks  were  developed.  The  first  task,  programmed  for  the  LGP-30 
computer  of  the  Psychometric  Laboratory  by  Mr.  Amnon  Rapoport  (see 
Working  Papers  Nos.  11  and  26)  was  a  computerized  version  of  a 
task  (see  Shuford,  Wiesen,  Massengill,  and  Nishisato,  "Pre-deci- 
sional  Processes  Related  to  Psychophysical  Judgment"  Report  No.  33, 
Psychometric  Laboratory,  January  1963  for  description  of  original 
task)  requiring  estimation  of  proportions.  In  this  task  the  indl- 
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viduai  observes  a  sample  of  elements  from  a  population  and,  on  the 
basis  of  this  Incomplete  information,  ^"tl, nates  the  percentage  of 
elements  of  a  given  type  in  the  population.  The  Individual  Is  paiu 
a  fixed  amount  for  estimating  the  actual  percentages  within  -.r;e  or 
two  percentage  points.  Since  the  distribution  of  actual  percentage 
occurring  during  the  experiment  may  deviate  markedly  from  the  uni¬ 
form,  a  Bayesian  decision  maker  would  improve  his  performance  by 
learning  this  distribution  and  combining  this  information  with  the 
sample  Information  to  obtain  a  better  estimate.  Further,  the  dis¬ 
tribution  cf  actual  percentages  may  change  one  or  more  times  uuring 
the  course  of  an  experiment.  Thus  the  Bayesian  decision  maker  must 
be  able  to  detect  these  changes  and  realize  that  he  Is  operating  In 
a  different  environment.  This  computer-controlled  task  was  used  to 
study  how  humans  learn  prior  probabilities  and  how  they  deal  with 
this  particular  type  of  nonstationary  environment. 

In  one  such  study  utilizing  this  computer-controlled  task, 

Mr.  Amnon  Rapoport  (1963)  obtained  three  types  of  measures  of  each 
subject,  from  which  the  prior  (subjective)  probability  distribution 
could  be  inferred  for  each  block  of  ten  trials.  The  stimuli  In 
the  experiment  consisted  of  a  series  of  3  x  3  matrices  composed  of 
randomly  placed  0's  and  l's.  Every  3x3  matrix  was  a  sample  from 
a  larger  10  x  10  matrix  containing  a  given  proportion  of  l's,  this 
proportion  being  randomly  selected  from  a  given  beta  distribution. 
The  experiment  consisted  of  120  trials,  and  the  environment  was 
nonstationary,  for  different  beta  distributions  were  used  in  gen¬ 
erating  the  stimuli  for  trials  1-50,  trials  50-^0,  and  trials  9I-12C 
subjects  not  being  told  that  these  environmental  shifts  would  occur. 
On  each  trial  the  subject  first  made  a  prediction  of  the  proportion 
of  l's  in  the  large  unobservable  10  x  10  matrix.  Then  he  was  pre¬ 
sented  with  the  3x3  matrix,  from  which  he  made  an  estimate  of  the 
proportion  of  l's  In  the  larger  matrix.  After  each  block  of  ten 
trials,  direct  measures  of  the  subjective  probability  distribution 
(SPD)  were  obtained  from  the  subject's  responses  to  three  hypo¬ 
thetical  bets  concerning  the  value  of  the  next  proportion  that  the 
computer  might  select.  After  making  their  estimates  of  the  pro¬ 
portion,  subjects  were  informed  of  the  correctness  of  their  esti¬ 
mates  and  predictions  on  that  trial  and  of  their  cumulative 
earnings,  subjects  thus  being  provided  with  complete  feedback  on 
each  trial. 

Several  findings  were  noted  in  the  experiment,  as  follows: 

1)  as  predicted  from  the  Bayesian  model,  the  subjects'  predictions 
tended  to  approach  the  mode  of  the  stimulus  distributions,  and  the 
variance  among  subjects  decreased  as  more  information  was  obtained; 

2)  subjects  tended  to  overestimate  the  variance  of  the  stimulus 
distribution,  indicating  that  the  SPD's  were  somewhat  flatter  than 
the  actual  stimulus  distributions;  3)  correspondence  among  the 
three  measures — i.e.,  predictions,  estimates,  and  responses  to 
hypothetical  bets — averaged  over  subjects,  was  good,  though  far 
from  perfect;  ^)  though  the  beta  distribution  from  which  the 
stimuli  were  generated  changed  twice  during  the  experiment,  the 
second  shift  was  responded  to  much  more  rapidly  than  the  first, 
showing  that  subjects  became  sensitive  to  the  nonstat i onari ty  cf 
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the  environment.  The  Bayesian  model  was  found  to  be  incomplete  in 
that  it  predicted  only  the  asymptotic  performance  of  the  subjects, 
having  nothing  to  say  concerning  the  learning  which  leads  to  this 
asymptotic  state.  An  extension  of  the  Bayesian  model  was  formula¬ 
ted  with  the  intention  of  accounting  for  the  trial-to-trial  changes 
in  the  predictions  and  estimates  of  individual  subjects.  This  model 
had  two  free  parameters,  concerning  individual  differences,  one 
for  weight  given  to  past  experience  and  another  for  the  weight 
given  to  the  successive  most  recent  observations.  When  values  of 
these  parameters  providing  the  best  fit  for  average  predictions  and 
estimates  of  the  subjects  were  used,  the  fit  was  remarkably  good. 

A  second  study  utilizing  this  same  computer-controlled  task 
(Rapoport,  1964)  had  as  its  main  purpose  the  testing  of  a  method 
of  measuring  the  parameters  of  a  continuous  SPD,  obtained  at  several 
points  within  a  sequential  decision  task.  This  method,  called  the 
Range  Betting  Method,  was  developed  by  Toda  (see  Working  Papers 
Nos.  1,  2,  3,  and  6  and  also  Toda,  1963).  Subjects  made  predic¬ 
tions  and  estimates  of  proportions  for  120  trials  in  a  nonstation¬ 
ary  environment,  as  before.  This  experiment  differed  from  the 
above  in  that  the  stimulus  samples  consisted  of  a  varying  number 
of  l’s  and  0*s,  the  number  of  elements  being  randomly  determined, 
between  1  and  20,  on  each  trial.  Subjects  were  divided  into  two 
groups,  one  group  receiving  complete  feedback  (Group  CF),  and  the 
second  group  receiving  only  partial  feedback  (Group  PF) .  Subjects 
in  the  CF  condition  were  informed  on  each  trial  as  to  correctness 
of  their  estimates,  the  correct  value  for  the  proportion  being 
printed  by  the  Flexowriter.  Subjects  in  the  PF  condition  received 
no  information  after  each  trial,  but  were  informed  of  their  earnings 
only  after  each  block  of  five  trials,  thus  being  given  only  a 
general  idea  as  to  performance  level  without  knowledge  of  whether 
their  gains  were  a  function  of  predictions  or  estimates  or  on  which 
trials  they  were  correct.  After  each  block  of  five  trials,  subjects 
in  both  groups  made  predictions  concerning  the  range  within  which 
the  next  selected  proportion  would  fall.  In  comparing  the  predic¬ 
tions  of  subjects  in  this  experiment  with  those  in  the  previous, 
it  was  found  that  subjects  in  the  CF  condition  gave  approximately 
the  same  values  as  those  in  the  previous  study,  whereas  subjects 
in  the  PF  condition  gave  predictions  significantly  different  from 
those  in  the  previous  study,  showing  almost  no  evidence  of  learning 
and  with  their  predictions  tending  to  concentrate  near  .5.  As  in 
the  previous  experiment,  the  inferred  SPD,  averaged  over  subjects, 
was  flatter  than  the  actual  distribution.  The  subjects*  estimates 
of  the  range  boundaries  were  compared  to  predicted  values  by  the 
range  betting  method  and  also  to  predicted  values  by  an  alterna¬ 
tive  confidence  limits’  model.  The  range  betting  method  was  found 
to  be  adequate  for  measuring  the  parameters  of  a  continuous  SPD  so 
long  as  the  assumption  is  made  that  the  SFD  is  a  beta  distribution. 
As  this  assumption  is  questionable,  the  alternative  conflden  e 
limits  model,  which  also  takes  account  of  Individual  difference 
parameter  concerning  the  recency  of  information  used  in  making 
the  range  estimates,  was  considered  a  more  realistic  psychological 
mode  1 . 
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A  second  computer-controlled  decision  task  devised  for  experi¬ 
mental  testing  of  the  proposed  models  was  that  programmed  by  Mr. 
Raymond  Wiesen  (see  Working  Paper  No.  12).  In  this  task,  the  de¬ 
cision  maker  must  choose  one  from  a  small  number  of  alternatives. 

A  fixed  payoff  is  received  for  the  choice  of  the  correct  alterna¬ 
tive.  The  decision  maker  may  request  information  to  aid  in  making 
the  decision,  but  the  more  information  requested  the  greater  is 
the  cost  to  the  decision  maker.  The  task  may  be  set  up  in  two 
modes.  In  the  fixed  sampling  mode,  the  decision  maker  states  the 
number  of  observations  desired,  this  information  is  provided,  and 
the  decision  maker  must  then  make  his  decision.  In  the  sequential 
sampling  mode,  the  decision  maker  may  request  one  observation,  ob¬ 
serve  the  result,  decide  to  take  another  observation  or  to  stop 
and  make  his  terminal  decision,  and  so  on.  This  task  is  arranged 
so  that  cost  of  information,  payoffs,  number  of  alternatives, 
symmetry  or  asymmetry  of  alternatives,  prior  probabilities,  and 
sampling  mode  may  be  varied  during  an  experimental  session  according 
to  any  prearranged  schedule.  The  dependent  variables  of  interest 
are  amount  of  information  requested,  earnings,  objective  expected 
payoff  of  actual  choice,  and  objective  probability  of  correctness 
of  actual  choice. 

An  experiment  utilizing  this  computer-controlled  task  was 
that  of  Mr.  David  Messick  (1964a).  This  experiment  in  sequential 
information  seeking  involved  two  Independent  variables:  number 
of  terminal  acts  or  response  alternatives  and  information  available 
to  the  decision  maker  prior  to  beginning  the  task.  According  to 
Bayesian  decision  theory,  the  larger  the  number  of  possible  termi¬ 
nal  acts,  the  more  pre-decisional  information  is  needed;  and  the 
more  prior  information  available  about  which  of  the  terminal  acts 
is  correct,  the  less  current  information  will  be  needed  to  be  ob¬ 
served  prior  to  making  the  decision.  On  the  basis  of  samples 
from  a  binomial  distribution,  subjects  were  to  estimate  the  pro¬ 
portion  of  l's,  or  "top  quality"  items.  Subjects  sampled  the  uni¬ 
verse  sequentially,  stopping  after  some  numoer  of  observations 
had  been  taken  in  order  tc  make  a  terminal  decision  by  selecting 
that  one  of  the  mutually  exclusive  and  exhaustive  subsets  of  the 
unit  interval  which  was  believed  to  contain  the  proportion.  The 
number  of  subsets,  or  possible  terminal  acts,  was  either  three 
(.00  to  .33,  .33  to  .67,  or  .67  to  1.00)  or  five  (.00  to  .20,  .20 
to  .40,  .40  to  .60,  .60  to  .  80,  .80  to  1.00).  Two  prior  frequency 
distributions  were  used--either  a  rectangular  distribution  or  a 
negatively  skewed  distribution  over  the  unit  interval.  Thus  four 
different  conditions  for  the  32  experimental  trials  were  possible, 
defined  by  the  four  combinations  of  the  two  decision  partitions  of 
the  unit  interval  and  the  two  prior  frequency  distributions.  It 
was  found,  as  expected,  that  more  observations  were  taken  in  the 
5-act  case  than  in  the  3-act  case.  No  effect,  however,  of  prior 
frequency  distributions  was  found,  but  there  was  a  tendency  for  sub¬ 
jects  to  make  more  observations  in  the  latter  16  trials  than  in  the 
first  16.  Subjects  did  not  make  full  use  of  the  information  pro¬ 
vided  by  the  prior  distribution,  for  fewer  observations  should  have 
been  required  when  the  prior  distribution  was  skewed,  since  such  a 
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distribution  makes  for  less  stimulus  uncertainty  than  does  the 
rectangular  distribution. 


As  no  formal  model  was  applicable  to  the  problem  presented 
in  this  experiment,  an  optimal  Bayes  strategy  model  was  derived 
so  as  to  permit  a  more  detailed  investigation  of  the  extent  to 
which  human  information  seeking  may  be  accounted  for  in  terms  of 
a  "rational"  theory  of  behavior  (Messick,  1964b).  The  important 
variable  in  determining  a  stopping  rule  in  this  task  was  found  to 
be  the  expected  probability  of  being  correct.  A  "no  information" 
theorem  was  presented  which  shows  that  under  some  circumstances 
when  a  success  or  failure  on  a  given  trial  are  equally  probable, 
the  probability  of  being  correct  after  making  the  observation  is 
identical  to  the  probability  of  being  correct  before  the  observa¬ 
tion  was  taken.  A  derivation  of  the  Beta-binomial  probability 
function  was  given  which  suggested  a  more  tractable  computational 
procedure  for  the  distribution  and  which  illuminates  its  limiting 
distribution . 

During  the  course  of  research  undertaken  as  part  of  this  pro¬ 
ject,  certain  limitations  in  the  Bayesian  approach  to  describing 
human  decision  processes  were  noted.  The  Bayesian  model  was  found 
to  be  incomplete,  in  that  it  predicts  behavior  in  a  situation  in 
which  the  stimulus  distribution  is  known  to  the  subject,  but  it 
has  little  or  nothing  to  say  concerning  the  process  by  which  the 
stimulus  distribution  is  learned.  Bayesian  models  may  assume  un¬ 
limited  memory  on  the  part  of  the  subjects  and  may  give  equal 
weight  to  all  events  observed  without  regard  to  recency;  full  uti¬ 
lization  of  all  observations,  as  well  as  accurate  integration  of 
current  observations  with  prior  information,  is  also  assumed. 

Much  of  the  research  reported  in  this  project  was  directed  toward 
making  extensions  of  the  more  elementary  Bayesian  model  and  taking 
account  of  assumptions  of  the  model  which  do  not  hold  in  human  in¬ 
formation  processing.  A  central  phase  of  the  latter  period  of 
the  project  was  the  development,  by  Dr.  Albert  H.  Amon,  of  a  des¬ 
criptive  syntax  for  general  classes  of  decision  structures  and 
the  utilization  of  this  analysis  in  the  construction  of  models  of 
human  decision  behavior  in  difficult  environments.  This  theo¬ 
retical  development  derived  from  information  processing  theory 
and  from  experiments  in  human  perception,  rather  than  from  the  tra¬ 
dition  of  statistical  decision  theory;  it  follows  up  some  lines 
of  thought  in  his  "Decision  Structures  in  Recognition"  (Amon, 

1962).  Because  of  the  importance  of  errors  and  other  nonoptimal 
behavior  for  efficient  constriction  of  the  class  of  alternative 
models,  decision  making  under  stress  was  emphasized  in  this  under¬ 
taking.  Dr.  Amon  had  made  considerable  progress  in  developing 
several  alternative  models,  and  had  begun  writing  what  would  have 
been  a  monograph  to  be  distributed  unde’-  terms  of  the  contract  at 
the  time  of  his  unexpected  death  on  Au f.  st  9,  1964.  Though  ex¬ 
tensive  notes  for  this  monograph  are  available,  the  material  is 
far  from  being  in  finished  form.  The  tu/.U  of  organizing  and 
editing  this  material  has  been  undertaken,  however,  and  it  is  hoped 
that  a  report  of  Dr.  Amon's  thinking  in  this  area  will  be  published 
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at  some  later  time. 


Other  research  within  the  information  processing  framework 
was  carried  out  by  Mr.  William  Wynn,  toward  construction  of  a 
model  of  certain  perceptual  and  memory  aspects  of  paired-associate 
learning. 

Another  activity  initiated  by  Dr.  Amon  was  the  design  and 
assembly  of  a  computerized  experimental  control  and  recording 
system.  Mr.  Larry  Gordon  and  Mr.  Charles  Lambright ,  who  were 
assisting  Dr.  Amon  in  the  project,  and  Mr.  Larry  Baucom,  are 
making  progress  toward  completion  of  the  system.  When  operational, 
the  system  will  make  possible  single  or  multiple  subject  experi¬ 
mentation,  under  precisely  controlled  conditions,  with  data  acqui¬ 
sition  at  high  rates  recorded  on  magnetic  tape  and  interpretable 
by  the  UNIVAC  1105  of  the  Computation  Center  of  the  University  of 
North  Carolina. 
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III.  ABSTRACTS  OF  TECHNICAL  DOCUMENTARY  REPORTS 


ESD-TDR-64-603 
David  M.  Messlck 

BAYESIAN  DECISION  THEORY,  GAME  THEORY  AND  GROUP  PROBLEM-SOLVING 


This  paper  applies  the  concepts  of  the  "ideal"  information  pro¬ 
cessor  and  rational  decision  maker  to  a  typical  problem  in  social 
psychology — that  of  group  problem  solving.  The  structure  of  the 
selected  task  is  seen  to  be  that  of  a  nonzero  sum  game.  A  strate¬ 
gy  is  derived  and  is  shown  to  be  the  equivalent  of  the  Nash  solu¬ 
tion  to  the  game.  The  notion  of  level  of  aspiration  is  discussed 
and  defined  within  the  analysis  and  two  theorems  are  proved  rela¬ 
ting  level  of  aspiration  to  type  of  group  decision  strategy  employed. 


Project  &  Task:  4690-469003 


Contract  No.  AF  19(628)-1610 
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David  M.  Messick 


SEQUENTIAL  INFORMATION  SEEKING :  AN  OPTIMAL  STRATEGY  AND  OTHER 
RESULTS 


This  paper  presents  an  optimal  strategy  for  sequential  sampling 
from  binomial  distributions.  The  strategy  presented  is  general 
in  that  it  is  a  ’'multi-action'’  rather  than  a  two-acfion  procedure 
While  the  major  task  is  to  estimate  the  proportion,  p,  of  "suc¬ 
cesses"  in  a  hypothetical,  infinite  population  of  binary  obser- 
varions,  it  is  assumed  that  the  decision  maker  is  only  concerned 
with  which  of  a  set  of  mutually  exclusive  and  exhaustive  subsets 
of  the  unit  Interval  contains  p.  The  derived  strategy  maximizes 
the  decision-maker's  gain  without  regard  to  error  probabilities. 

The  Important  variable  in  determining  a  rule  for  ceasing  to 
look  at  new  data  and  making  a  decision  is  found  to  be  the  expec¬ 
ted  probability  of  being  correct.  The  criterion  involves  only 
the  economic  aspects  of  the  situation.  A  "no  information"  theorer 
is  presented  which  shows  that  under  some  circumstances  when  a 
"success"  or  a  "failure"  on  a  given  trial  are  equally  probable, 
the  probability  of  being  correct  after  making  observation  is  iden¬ 
tical  to  the  probability  of  being  correct  before  the  observation 
was  taken.  Finally,  an  appealing  derivation  of  the  Beta-binomial 
probability  function  is  given  which  suggests  a  more  tractable  com¬ 
putational  procedure  for  the  distribution  and  which  illuminates 
its  limiting  distribution. 


Project  &  Task: 


A690-46900 3 


Contract  No.  AF  19(628)  — 1610 


Lb  I  -TDR-64-606 


David  M .  Messick 

SEQUENTIAL  INFORMATION  SEEKING '  EFFECTS  OF  THE  NUMBER  OF  TERMINAL 
ACTS  AND  PRIOR  INFORMATION 


An  experiment  was  conducted  by  means  of  a  digital  computer  in 
which  5**  human  Ss  were  faced  with  the  task  of  sampling  from  a 
hypothetical  binomial  universe  in  which  a  proportion,  p,  of  all 
observations  were  "top  quality."  Ss  sampled  the  universe  sequen¬ 
tially,  stopping  after  some  number  of  observations  had  been  made 
to  make  a  terminal  decision  by  selecting  the  one  of  the  S's 
mutually  exclusive  and  exhaustive  subsets  of  the  unit  interval 
which  S  believed  contained  p.  The  four  experimental  treatments 
were'  defined  by  the  four  combinations  of  the  two  decision  parti¬ 
tions  of  the  unit  interval,  one  involving  3  possible  terminal  acts, 
the  other  having  5  alternatives,  and  the  two  prior  frequency  dis¬ 
tributions,  one  a  rectangular  distribution  over  [0  -  1],  the  other 
being  negatively  skewed.  Analysis  of  variance  of  the  number  of 
predecision  observations  taken  indicated  a)  significant  individual 
differences;  b)  significant  S  by  treatment  interactions;  c) 
differences  attributable  to  the  decision  partitions  with  more  ob¬ 
servations  being  taken  in  the  5-act  case  than  in  the  3-act  case; 
and  d)  no  effect  of  prior  frequency  distributions,  but  a  tendency 
to  take  more  observations  in  the  second  16  trials  than  in  the  first 
16. 


Project  &  Task:  AC90-A69C03  Contract  No.  AF  19(S28)~l6l0 
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ESP-TDR-6*4-60*4 


Amnon  Rapoport 

A  STUDY  OF  HUMAN  DECISIONS  IN  A  COMPUTER-CONTROLLED  TASK  (Published 
as  SEQUENTIAL  DECISION  MAKING  IN  A  COMPUTER-CONTROLLED  TASK  in 
J.  Math.  Psychol.,  Vol.  1,  No.  2,  351-37**,  July,  196*0. 


Twenty-four  undergraduate  students  participated  in  a  computer-con- 
trolled  experimental  task,  intended  as  an  investigation  of  decision¬ 
making  processes  in  a  nonstationary  environment.  The  subjects,  who 
were  tested  individually,  were  instructed  to  .ake  a  sequence  of 
predictions  and  estimates  conditional  upon  typed  information  pro¬ 
vided  by  the  LGP-30  computer.  Specifically,  each  subject  was  re¬ 
quired  to  predict  the  value  of  an  unknown  p,  0  <_  p  <_  1.00,  and  then 
to  estimate  its  values  after  obtaining  partial  information  about 
it  from  the  computer  The  procedure  was  repeated  for  120  values  of 
p,  generated  randomly  by  three  different  beta  distributions.  Sub¬ 
jects  were  paid  immediately  according  to  their  success.  The  pa¬ 
rameters  of  the  subjective  probability  distributions  were  Inferred 
from  the  subject's  predictions  and  estimates  in  a  block  of  ten 
trials.  In  addition,  direct  measures  of  the  subjective  probability 
distributions  were  obtained  for  every  block  from  the  subject's 
verbal  responses  given  to  three  hypothetical  bets  concerning  tne 
value  of  the  next  p. 

An  optimal  model,  derived  from  Bayesian  decision  theory,  was 
tested  and  partially  confirmed  by  the  subjects'  responses.  Al¬ 
ternative  models  were  derived  and  tested,  and  some  theoretical 
implications  concerning  learning  processes  and  the  use  of  compi  - 
ters  in  psychological  experiments  were  discussed  briefly. 
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ESTIMATION  OF  CONTINUOUS  SUBJECTIVE  PROBABILITY  DISTRIBUTION  IN  A 
SEQUENTIAL  DECISION  TASK 


A  range  betting  method  (RBM)  for  measuring  continuous  subjective 
probability  distributions  is  presented.  Predictions  derived  from 
the  method  are  tested  in  a  computer-controlled  sequential  decision 
making  task.  In  particular,  twenty-two  undergraduate  students  par¬ 
ticipated  individually  in  the  task.  The  subjects  were  asked  oc¬ 
casionally  to  place  a  wager  upon  the  range  of  an  unknown  value  of 
P,  0  1.  P  i.  1  >  generated  by  an  LGP-30  computer.  In  addition,  the 
subjects  predicted  the  exact  value  of  P  by  typing  a  number  on  a 
Flexowriter,  and  then  estimated  the  value  of  P  after  obtaining  in¬ 
formation  about  its  value.  The  information  was  given  in  the  form 
of  n  values  ( 1  <_  n  <_  20)  of  0's  or  l's  generated  by  a  binomial  dis¬ 
tribution  with  mean  P  and  printed  on  the  Flexowriter  in  front  of 
the  subject. 

Predicted  decisions  were  derived  both  from  the  RBM  method  and 
from  a  relatively  simple  confidence  limits'  model.  The  subjects' 
estimates  of  the  range  boundaries  were  compared  to  the  predicted 
values  from  both  models.  In  both  comparisons  the  results  indicated 
a  regression  effect  and  a  b lock-to-biock  tracking  behavior.  At¬ 
tempts  to  compare  two  independent  measures  of  the  same  subjective 
probability  distribution  were  unsuccessful. 
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DECISION-THEORETIC  AND  EMPIRICAL  INVESTIGATION  OF  SOME  PROBA¬ 
BILISTIC  DISCRIMINATION  LEARNING  SITUATIONS 


A  decision-theoretic  analysis  and  experiment  of  three  related 
choice  situations  is  presented.  The  first  situation  is  a  standard 
probabilistic  discrimination  learning  task.  Each  task  begins  with 
the  presentation  of  one  of  a  set  of  stimuli.  The  subject  must 
choose  between  two  response  alternatives  to  predict  which  of  the 
two  events  will  occur  on  the  trial,  the  probability  of  each  event 
being  a  function  of  the  stimulus  presented.  The  second  situation 
arises  when  the  conditional  probabilities,  i.e.,  the  probabilities 
of  the  stimuli  given  the  events,  are  introduced  to  the  subject  at 
the  beginning  of  the  experiment.  The  third  situation  Is  like  the 
second  except  for  the  fact  that  the  subject  is  not  told  which 
event  occurs  on  each  trial. 

The  decision-theoretic  analysis  shows  what  differences  In  per¬ 
formance  would  be  expected  among  the  three  conditions  when  a  strate¬ 
gy  which  maximizes  average  expected  payoff  is  employed. 

One  group  of  subjects  was  run  in  each  situation  with  the  over¬ 
all  relative  frequency  of  one  event  equal  to  .80.  The  performance 
of  the  subjects  in  the  first  and  second  situations  was  virtually 
identical,  while  the  performance  of  the  subjects  in  the  third  (non¬ 
feedback)  was  somewhat  worse.  The  performance  measure  was  the  sum 
of  the  differences  between  the  objective  expected  payoff  of  the 
optimal  choices  and  the  choices  made  by  the  subject.  Comparisons 
of  the  choice  proportions  for  the  first  and  second  groups  indica¬ 
ted  that  subjects  in  the  second  group  did  not  Integrate  Informa¬ 
tion  concerning  the  overall  relative  frequencies  of  events  and 
conditional  probabilities.  A  large  proportion  of  subjects  In  the 
third  (non-feedback )  group  made  every  choice  in  agreement  with  the 
assumption  that  the  overall  relative  frequency  of  one  event  was 
one-half. 
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